91 research outputs found

    Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In bioinformatics it is common to search for a pattern of interest in a potentially large set of rather short sequences (upstream gene regions, proteins, exons, etc.). Although many methodological approaches allow practitioners to compute the distribution of a pattern count in a random sequence generated by a Markov source, no specific developments have taken into account the counting of occurrences in a set of independent sequences. We aim to address this problem by deriving efficient approaches and algorithms to perform these computations both for low and high complexity patterns in the framework of homogeneous or heterogeneous Markov models.</p> <p>Results</p> <p>The latest advances in the field allowed us to use a technique of optimal Markov chain embedding based on deterministic finite automata to introduce three innovative algorithms. Algorithm 1 is the only one able to deal with heterogeneous models. It also permits to avoid any product of convolution of the pattern distribution in individual sequences. When working with homogeneous models, Algorithm 2 yields a dramatic reduction in the complexity by taking advantage of previous computations to obtain moment generating functions efficiently. In the particular case of low or moderate complexity patterns, Algorithm 3 exploits power computation and binary decomposition to further reduce the time complexity to a logarithmic scale. All these algorithms and their relative interest in comparison with existing ones were then tested and discussed on a toy-example and three biological data sets: structural patterns in protein loop structures, PROSITE signatures in a bacterial proteome, and transcription factors in upstream gene regions. On these data sets, we also compared our exact approaches to the tempting approximation that consists in concatenating the sequences in the data set into a single sequence.</p> <p>Conclusions</p> <p>Our algorithms prove to be effective and able to handle real data sets with multiple sequences, as well as biological patterns of interest, even when the latter display a high complexity (PROSITE signatures for example). In addition, these exact algorithms allow us to avoid the edge effect observed under the single sequence approximation, which leads to erroneous results, especially when the marginal distribution of the model displays a slow convergence toward the stationary distribution. We end up with a discussion on our method and on its potential improvements.</p

    Broad Spectrum Pro-Quorum-Sensing Molecules as Inhibitors of Virulence in Vibrios

    Get PDF
    Quorum sensing (QS) is a bacterial cell-cell communication process that relies on the production and detection of extracellular signal molecules called autoinducers. QS allows bacteria to perform collective activities. Vibrio cholerae, a pathogen that causes an acute disease, uses QS to repress virulence factor production and biofilm formation. Thus, molecules that activate QS in V. cholerae have the potential to control pathogenicity in this globally important bacterium. Using a whole-cell high-throughput screen, we identified eleven molecules that activate V. cholerae QS: eight molecules are receptor agonists and three molecules are antagonists of LuxO, the central NtrC-type response regulator that controls the global V. cholerae QS cascade. The LuxO inhibitors act by an uncompetitive mechanism by binding to the pre-formed LuxO-ATP complex to inhibit ATP hydrolysis. Genetic analyses suggest that the inhibitors bind in close proximity to the Walker B motif. The inhibitors display broad-spectrum capability in activation of QS in Vibrio species that employ LuxO. To the best of our knowledge, these are the first molecules identified that inhibit the ATPase activity of a NtrC-type response regulator. Our discovery supports the idea that exploiting pro-QS molecules is a promising strategy for the development of novel anti-infectives

    CONSIDERATIONS FOR AN INNOVATIVE HIGH TEMPERATURE BATTERY IN POWERPLANT APPLICATIONS

    Get PDF
    A novel high temperature battery based on the concept of the solid oxide fuel cell (SOFC) is presented. Due to the use of cheap iron- and calcium-based storage materials providing a high theoretical capacity of roughly 1300 Wh/kg the battery could be used to optimize the part load properties and the long term durability of conventional power plants. The elevated working temperature of 800 °C makes it applicable where high quality heat is available and needed. In this paper an economical consideration leads to general design recommendations for this battery which operates in test mode with current densities of 150 mA/cm2 and approximately one hour charging/discharging time at cell voltages between 0.7-1.2 V
    • …
    corecore